EraX-VL-7B-V1.5 is a powerful multimodal model specializing in Optical Character Recognition (OCR) and Visual Question Answering (VQA), excelling in multilingual environments with particular expertise in Vietnamese.
Image-to-Text
Transformers Supports Multiple Languages